Deterministic natural language generation from meaning representations for machine translation
نویسنده
چکیده
This paper describes a deterministic method for generating natural language suited to being part of a machine translation system with meaning representations as the level for language transfer. Starting from Davidsonian/Penman meaning representations, syntactic trees are built following the Penn Parsed Corpus of Modern British English, from which the yield (i.e., the words) can be taken. The novel contribution is to highlight exploiting the presentation of meaning content to inform decisions regarding the selection of language constructions: active vs. passive, argument subject vs. expletive it vs. existential there, discourse vs. intra-sentential coordination vs. adverbial clause vs. participial clause vs. purpose clause, and infinitive clause vs. finite clause vs. small clause vs. relative clause vs. it cleft.
منابع مشابه
Generation by Inverting a Semantic Parser that Uses Statistical Machine Translation
This paper explores the use of statistical machine translation (SMT) methods for tactical natural language generation. We present results on using phrase-based SMT for learning to map meaning representations to natural language. Improved results are obtained by inverting a semantic parser that uses SMT methods to map sentences into meaning representations. Finally, we show that hybridizing thes...
متن کاملGeneration From Lexical Conceptual Structures
This paper describes a system for generating natural language sentences from an interlingual representation, Lexical Conceptual Structure (LCS). This system has been developed as part of a Chinese-English Machine Translation system, however, it promises to be useful for many other MT language pairs. The generation system has also been used in CrossLanguage information retrieval research (Levow ...
متن کاملMeaning Representation for Knowledge Sharing in Practical Machine Translation
Knowledge-based machine translation can be viewed as the problem of extracting and representing the meaning of a text and generating a translation in a target language using the meaning representation. Meaning extraction requires the integration of information present explicitly in a text with common sense and domain knowledge given to the system. Thus, integrating linguistic knowledge of each ...
متن کاملLearning for Semantic Parsing Using Statistical Machine Translation Techniques
Semantic parsing is the construction of a complete, formal, symbolic meaning representation of a sentence. While it is crucial to natural language understanding, the problem of semantic parsing has received relatively little attention from the machine learning community. Recent work on natural language understanding has mainly focused on shallow semantic analysis, such as word-sense disambiguat...
متن کاملThe aNALoGuE Challenge: Non Aligned Language GEneration
We propose a shared task based on recent advances in learning to generate natural language from meaning representations using semantically unaligned data. The aNALoGuE challenge aims to evaluate and compare recent corpus-based methods with respect to their scalability to data size and target complexity, as well as to assess predictive quality of automatic evaluation metrics.
متن کامل